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Abstract — This paper considers pilot-based channel estimation 
in large-scale multiple-input multiple-output (MIMO) communi- 
cation systems, also known as "massive MIMO". Unlike previous 
works on this topic, which mainly considered the impact of 
inter-cell disturbance due to pilot reuse (so-called pilot contam- 
ination), we are concerned with the computational complexity. 
The conventional minimum mean square error (MMSE) and 
minimum variance unbiased (MVU) channel estimators rely on 
inverting covariance matrices, which has cubic complexity in the 
multiplication of number of antennas at each side. Since this 
is extremely expensive when there are hundreds of antennas, 
we propose to approximate the inversion by an L-order matrix 
polynomial. A set of low-complexity Bayesian channel estimators, 
coined Polynomial ExpAnsion CHannel (PEACH) estimators, are 
introduced. The coefficients of the polynomials are optimized 
to yield small mean square error (MSE). We show numerically 
that near-optimal performance is achieved with low polynomial 
orders. In practice, the order L can be selected to balance 
between complexity and MSE. Interestingly, pilot contamination 
is beneficial to the PEACH estimators in the sense that smaller 
L can be used to achieve near-optimal MSEs. 

I. Introduction 

MIMO techniques can bring substantial improvements in 
spectral efficiency to wireless systems, by increasing the 
spatial reuse. While 8x8 MIMO transmission has found its 
way into standards such as LTE- Advanced [1], there is an 
increasing interest in equipping base stations with much larger 
antenna arrays J2)-S. Such large-scale MIMO, or "massive 
MIMO", techniques can give unprecedented spatial resolution, 
enabling a very dense spatial reuse that potentially can keep up 
with the rapidly increasing demand for wireless connectivity. 

A major limiting factor in large-scale MIMO is the availabil- 
ity of accurate channel state information (CSI). This is since 
high spatial resolution can only be exploited if the propagation 
environment is precisely known. CSI is typically acquired by 
sending pilot signals and estimating the channel coefficients 
from the received signals [5|-[10|. The Bayesian MMSE 
estimator can be applied [7|-[10| if the channel statistics are 
known, while the MVU estimator is applied otherwise Q. 

These channel estimators basically solve a linear system of 
equations, or equivalently multiply the received signal with an 
inverse of the covariance matrices, which is a mathematical 
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operation with cubic computational complexity. Therefore, it 
is very computationally expensive to compute the MMSE 
and MVU estimates in large-scale MIMO systems. The high 
complexity can be avoided under ideal propagation conditions 
where all covariance matrices are diagonal, but large-scale 
MIMO channels typically have a distinct spatial correla- 
tion due to insufficient antenna spacing and richness of the 
propagation environment |0]. Moreover, the necessary pilot 
reuse in cellular networks creates spatially correlated inter- 
cell interference, known as pilot contamination, that reduces 
the estimation performance and the spectral efficiency @-|[6l. 

Similar complexity issues appear in multiuser detection, 
where both the decorrelating detector and the linear MMSE 
detector involve matrix inversions ifTTI . A common low- 
complexity approach is reduced-rank filtering [12]. This can 
be achieved by polynomial expansion (PE), where the matrix 
inverse is approximated by an L-order matrix polynomial 
[ 1 1 ]— [ 14]. PE-based detectors are versatile since the structure 
enables simple multistage hardware implementation [11] and 
the order L needs not to scale with the system size to achieve 
near-optimal performance [12]. Therefore, L is simply selected 
to balance between complexity and detection performance. A 
main problem is to select the coefficients of the polynomial 
to achieve high performance at small L; the optimal weights 
are expensive to compute |jTT|. but alternatives based on 
appropriate scalings [13] and asymptotic analysis lfl4l exist. 

Inspired by the prior works in detection, in this paper, 
we propose a set of low-complexity channel estimators that 
we call Polynomial ExpAnsion CHannel (PEACH) estimators. 
These novel estimators approximate the MMSE estimator by 
replacing the matrix inversion with a polynomial expansion. 
The coefficients of the polynomial are optimized to yield 
minimal MSE at any fixed polynomial order L, while keeping 
the low complexity. The PEACH estimators are evaluated 
under different propagation/interference conditions and show 
remarkably good performance at low polynomial orders. 

Notation: Boldface (lower case) is used for column vectors, 
x, and (upper case) for matrices, X. Let X T , X ff , and X -1 
denote the transpose, the conjugate transpose, and inverse of 
X, respectively. The Kronecker product of X and Y is denoted 
X<8> Y, vec(X) is the vector obtained by stacking the columns 
of X, tr(X) denotes the trace, and ||X||f is the Frobenius 
norm. The notation = denote definitions, while 0(M X ) means 
that the complexity is bounded by CM x for some C > 0. 
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Fig. 1 . Illustration of pilot signaling in a large-scale Nt X N r MIMO system, 
where N r ^S> Nt ■ The complexity of conventional channel estimators is very 
large in these systems, which calls for low-complexity alternatives. 



II. Problem Formulation 

This paper revisits the problem of estimating a quasi- 
static flat-fading MIMO channel H £ C NrXNt , where N t is 
the number of transmit antennas and N r is the number of 
receive antennas. Similar to 0-00], the transmitter sends a 
predefined pilot matrix P £ C Nt x B over B > 1 symbol times; 
see Fig. [T] The receiver tries to obtain H from the received 
signal Y <E <C NrXB , given by 



HP + N 



(1) 



where the disturbance N £ <C NrXB is assumed to be 
circularly-symmetric complex Gaussian noise: vec(N) ~ 
CN~(0, S). The disturbance covariance matrix S £ £ N - BxN - B 
is positive definite and can include both regular uncorrelated 
receiver noise and different types of interference from other 
systems. The analysis herein holds for any S, but some typical 
special cases will be considered in Section IIV-BI 

The channel matrix is modeled as Rayleigh fading with 
vec(H) ~ CjV(0, R), where the channel covariance matrix 
R £ c N tN r xN t N r j s positive semi-definite. Observe that R is 
generally not a scaled identity matrix, but describes the spatial 
propagation environment. 

If the channel statistics are known at the receiver, the 
Bayesian MMSE estimator of the MIMO channel is Q-Qjl 



RP H ( PRP 



>H 



S) vec(Y) (2) 



vcc(Hmmse) 

where P = (P T <g) I). The performance is measured in MSE = 
E{||H - HmmseIH} - tr ((R- 1 + P^S- 1 ?)- 1 ). 

Alternatively, if the channel distribution is unknown to the 
receiver, the classic MVU estimator is [7. Chapter 4] 

vec(H M vu) = (f^S- 1 ?) ~* P H S- 1 vcc(Y). (3) 

The performance measure is then the estimation variance 
E{||H - HmvuIH} = tr ((P^S^P)- 1 ). Obviously, 



tr (R- 1 + P^S- 1 ?)" 1 < tr (P^S^P) 



(4) 



for any R/O, thus the MMSE estimator achieves a better 
average estimation performance than the MVU estimator since 
it utilizes the channel statistics. 

A. Complexity Issues in Large-Scale MIMO Systems 

The main computational complexity when computing the 
MMSE and MVU estimators in © and © lies in solving a 
linear system of equations or, equivalently, in computing the 
matrix inversions directly!^ Both approaches have computa- 
tional complexities that scale as 0(M 3 ) where M = NtN r JQ 
This complexity is relatively modest in classic MIMO systems 
where 2 x 2, 4 x 4, or 8 x 8 are typical configurations. 

Recently, there is an increasing interest in large-scale MIMO 
systems where there might be hundreds of antennas at one 
side of the link ©-Hi. To excite all channel dimensions, the 
pilot length B should be of the same order as Nt. Large-scale 
systems are therefore envisioned to exploit channel reciprocity 
to always have Nt < N r in the channel estimation phase — 
N r can even be orders of magnitude larger than Nt without 
degrading the estimation performance per antenna element. 

Observe that in a potential future large-scale MIMO system 
with N r = 200 and N t = 20, the MMSE and MVU estimators 
would require inverting matrices of size 4000 x 4000 (or sim- 
ilarly, solving a linear system of equation with 4000 unknown 
variables). The purpose of this paper is to develop alternative 
channel estimators that allow for balancing between compu- 
tational/hardware complexity and estimation performance. 

Remark 1. While having Gaussian channels and disturbance 
is a well-accepted assumption in conventional MIMO systems, 
the channel modeling for large-scale MIMO is still in its 
infancy. By increasing the number of antennas we will improve 
the spatial resolution of the array which eventually may 
invalidate the rich-scattering assumption that is behind the use 
of Gaussian channel distributions [4 J. However, we stress that 
the estimators in this paper can be applied and give reasonable 
performance under any channel and disturbance distributions; 
this is since © is also the linear MMSE estimator and © is 
the best linear unbiased estimator (BLUE) in cases when only 
the first two moments of H and/or N are known [Tj, MOV . 

III. Low-Complexity Bayesian PEACH Estimators 

In this section, we propose several low-complexity Bayesian 
channel estimators based on the concept of polynomial expan- 
sion. To understand the main idea, we first state the following 
lemma which is easily proved by using standard Taylor series. 

'if R, S, P are all diagonal matrices, the complexity can be greatly reduced 
by simply inverting the diagonal elements. However, such special cases are of 
limited practical interest, particularly in large-scale MIMO systems which are 
prone to non-negligible spatial channel correlation and pilot contamination. 

2 Note that 0(M 3 ) refers to the complexity scaling of the classical Gaussian 
elimination algorithm. The complexity is reduced to C(M 2,8074 ) by the more 
sophisticated Strassen's algorithm. The exponent can be further reduced, see 
e.g. 1 15 1, but mainly for academic purposes since extremely large matrices 
are required to actually benefit from such improved asymptotic behaviors. 



Lemma 1. For any Hermitian matrix X e C NxN , with 
bounded eigenvalues \ X n (X) | < 1 for all n, it holds that 



(i-xr^^rx*. 



(5) 



;=o 



Observe that the impact of X' in (f5]l reduces with I (as 
A n (X) z for each eigenvalue). It therefore makes sense to 
consider L-order polynomial expansions of the matrix inverse 
using only the terms I = 0, . . . , L. In principle, the inverse 
of each eigenvalue is then approximated by an L-order Taylor 
polynomial, thus L needs not scale with the matrix dimension 
to achieve a certain accuracy per element. Instead, L can be 
selected to balance low approximation error with complexity. 

In order to apply Lemma Q] on matrices with any eigenvalue 
structure, we obtain the next result which is similar to ifPJI . 

Proposition 1. For any positive-definite Hermitian matrix X, 

L 

X- 1 =a(l-(I-aX))~ 1 sia^(I-aX) 1 (6) 

1=0 

where the approximation holds with equality when L —¥ oo if 
a is selected to satisfy < a < . ,„- . 

JJ max™ X n (.X.) 

A. Unweighted PEACH Estimator 

Applying the approximation in Proposition Q~|on the MMSE 
estimator in (O gives the low-complexity L-order Polynomial 
ExpAnsion CHannel (PEACH) estimator 

L 

vec(H PEACH ) = RP ff J2 a ( X ~ «(P R P H + S))'vec(Y) 

1=0 

(7) 
which does not involve any channel inversions. The compu- 
tational complexity of © is 0{LM 2 ) where M = N t N r . 
Whenever L <C M, this is a large complexity reduction as 
compared to 0(M 3 ) for the original MMSE estimator. 

Theorem 1. The PEACH estimator in © achieves the MSE 

tr (r + RP H A L (PRP H + S)Af PR 2RP H A L PR 

where A L = £f =0 a(l - a(PRP fl + S))'. 

Proof: Follows from direct computation of the MSE. ■ 
The PEACH estimator lends itself to efficient multistage 
hardware implementation where (-) i+1 vec(Y) is computed 
from (•) i vec(Y) by multiplying the latter vector with (i — 
a(PRP fl +S)). The hardware can be similar to the multistage 
detection implementation illustrated in [LL Fig. 1]. 

It remains to select the scaling parameter a to satisfy the 
condition in Proposition Q] From a complexity point of view, 
we easily can select a to be equal to 



8) 



other hand, we are also interested in choosing a to achieve fast 
convergence in the polynomial expansion. Among the values 
that satisfy the condition in Proposition Q] the choice 

2 

aPEACH = "~ ~-~ T ; 7= ~— " 

max„ A n (PRP ff + S) + min n A„(PRP ff + S) 

(9) 



will maximize the asymptotic convergence since the largest 
and smallest eigenvalue of (I — q(PRP h + S)) become 
symmetric around the origin liT3l . Although the computation 
of the extreme eigenvalues is generally quite expensive, we 
note that these can be deduced from the setup; that is, path- 
loss, antenna array design, location of dominating interferers, 
receiver noise, etc. We therefore assume that «peach can be 
computed with low complexity in this paper and refer to [13] 
for general techniques for efficient computation of eigenvalues. 

B. Weighted PEACH Estimator 

Although the PEACH estimator (0 converges to the MMSE 
estimator as L — >• oo, it is generally not the best L-order 
polynomial estimator at finite L. More specifically, instead 
of multiplying each term in the sum with a, we can assign 
different weights and optimize these for the specific order L. 
In this way, we obtain the weighted PEACH estimator 

L 

vcc(H W -peach) = RP 



H 



1=0 



wia l+1 (PKP H + S) vec(Y) 

(10) 

where w = [wq, . . . , wl] T are scalar weighting coefficients]^ 
Observe that the a-parameter is now redundant and can be set 
to one. For numerical reasons, it might still be good to select 

aW-PEACH < -~ — ^— (11) 

max„A„(PRP i/ + S) 

since this prevent the eigenvalues of a l+1 (PRP H + S) to 
grow unboundedly as I becomes large. This will simplify 
the implementation of the following theorem, which finds the 
weights that minimize the MSE. 

Theorem 2. The MSE E{||H — Hw-peach|||'} is minimized 

by 
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° pt ] T = A- X b 



(12) 



where the ijth element of A E i^L+ixL+i anfl p tne ^ n e [ emen i 
of he C L+1 are 



[A]y =a l+j tr(RP H ( 
(RP ff ( 



PRP 



H 



sy+j-i-pR) , 



a'tr 



PRP 



H 



\i— 1 



PR 



The resulting MSE of the W-PEACH estimator is 



tr(R) 



W ^t Aw o 



,H 



opt-™ "opt b W opt "opt" 



(13) 



(14) 



Proof: Follows from differentiation of the MSE. ■ 

Although Theorem |2] provides the optimal weights, the 
computational complexity is 0(M 3 ) since it involves pure 
matrix multiplications of the form Z\ This means that com- 
puting the optimal weights for the W-PEACH estimator has the 
same asymptotic complexity scaling as computing the original 
MMSE estimator. To benefit from the weighting we thus need 
to find a low-complexity approach to compute the weights. 

3 W-PEACH is obtained by expanding each (I - a(PRP fl + S))' as a 
binomial series, collecting terms, and replacing constant factors with weights. 



C. Low-Complexity Weights 

Next, we propose a low-complexity algorithm to compute 
weights for the W-PEACH estimator. We will exploit that 



(PRP ff + S) = E{vec(Y)vec(Y)" } = lim - ]T y t yf , 

t=l 

(15) 
where y< = vec(Y) denotes the received signal at estimation 
time instant t. This means that the matrix (PRP ff + S) 
can be closely approximated by the sample covariance matrix 
Y X)t=i y*y? if me number of samples T is large. 
For any fixed T > 1 and i > 1, we now observe that 

tr (RP H (PRP ff + S)*PR) (16) 

« tr Up ff (pRp ff + s)*- 1 f - Y, y*yf) p r ) ( 17 > 



±£y? (PR^PRP^ + Sr 1 )^. 



/=i 



(18) 



Since the elements of A and b in ( T3"3| ) are of the form 
in ( fToT l, we can approximate each element using (fT8lrl By 
computing/updating these approximations over a sliding time 
window of length T, we obtain Algorithm Q] At any time 
instant t, this algorithm computes approximations of A, b, de- 
noted by A t , b t , by using the received signals y t , . . . , yt-T+i- 
These are used to compute approximate weights w approxt . 
To reduce the amount of computations, A t , h t are obtained 
from At_i, bt i by adding one term per element based on the 
current received signal yt and removing the impact of the old 
received signal yt-T (which is now outside the time window). 
The algorithm can be initialized in any way; for example, by 
accumulating T received signals to fill the time window. 

The complexity of computing the elements in A t ,b t is 
0(L 2 M 2 ) per time instant since there are (L + 1)(L + 1) 
elements in A 4 and L + 1 elements in b t to be computed, 
which results in (L + 2)(L + 1) elements in total, and 
we need to compute a series of multiplications between 
vectors and matrices. Furthermore, w approXit is obtained by 
solving an i-dimensional system of equations, which has 
complexity 0(L 3 ). To summarize, the W-PEACH estimator 
along with Algorithm Q] has a computational complexity of 
0(L 2 M 2 + LAI 2 + L 3 ) = 0{L 2 M 2 + L 3 ), which is smaller 
than C(M 3 ) of the MMSE estimator for L < </M = VWJ^. 

One additional feature of Algorithm[T]is that it can easily be 
extended to practical scenarios where also the true covariance 
matrices R and (PRP ff + S) are approximated by sample 
covariance matrices. This would enable adaptive tracking of 
the slow variations in channel and disturbance statistics that 
appear in practice. Due to space limitations, we leave this 
extension for future work. 

4 Note that bo = tr(PR 2 P H ) needs to be treated differently since there is 
no (PRP fl + S) term. For some predefined vectors Vj ~C/V(0, 1), Algo- 
rithm[T]uses the approximation tr(PR 2 P H ) w f ^T =l vfP~R 2 P H Vi. 



Algorithm Is Low-complexity weights for W-PEACH 

Input: Polynomial order L and time window T; 

Input: Current time t; 

Input: New and old received signals yt,yt-T', 

Input: Approximations A t _i,b t _i at previous time t— 1; 

l Set [A t ]ij = [At-i]ij 

" ^-yf (pR 2 P ir (PRP fl + S) J+J - 2 ) y t 



T 



yf_ T (pR 2 P ff (PRP H + S) i+ ^ 2 ) y t -T Vt, j 

2 Set [b t ]i = [b t _i]i 

+ ^yf (pR 2 P ff (PRP ff + Sr 2 )y t 

" %y"-T (PR 2 P H (PRP^ + S) 4 - 2 ) yt-T Vz > 2 

3 Set [bt]i = f ElivfPR 2 P ff Vi for v.^CAA(0,I); 

4 Compute w approx , 4 = A t _1 b f ; 

Output: Approximate weights w approxt at time i; 



D. Summary of Computational Complexity 

The complexity of the conventional estimators and proposed 
PEACH estimators are summarized as follows: 



Channel Estimators 


Computational Complexity 


MMSE and MVU 


(D(N?N?) 


PEACH 


0{LN'tN' 2 ) 


W-PEACH 


0(L 2 N 2 N 2 + L A ) 



We note that the cubic complexity scaling in N t N r for the 
conventional MMSE and MVU estimators have been reduced 
to squared complexity for the proposed PEACH estimators. 
The order L of the polynomial expansion has a clear impact 
on the complexity, but recall that it generally needs not scale 
with N t N r |[T2l . In the next section, we will illustrate that 
small values on L yields good performance. 

IV. Numerical Evaluation 

In this section, we illustrate the performance of the proposed 
PEACH and W-PEACH estimators. The analysis so far has 
been generic with respect to the disturbance covariance matrix 
S. Here, we consider two scenarios: noise-limited and cellular 
networks with pilot contamination. We describe the latter 
scenario in more detail since it is one of the main challenges 
in the development of large-scale MIMO systems [4|. 

A. Noise-Limited Scenario 

A commonly studied scenario is when there is only uncor- 
rected receiver noise; thus S = cr 2 I where a 2 is the variance. 

B. Pilot Contamination Scenario 

A scenario that has received much attention in the large- 
scale MIMO literature is when there is disturbance from 



simultaneous reuse of pilot signals in neighboring cells J2J- 
0. Such reuse is often necessary due to the finite channel 
coherence time (i.e., the time that a channel estimate can be 
considered accurate), but leads to a special form of interference 
called pilot contamination. It can be modeled a£] 



N 



E HP 



N 



(19) 



iex 



where X is the set of interfering cells, H,; is the channel from 
the zth interfering cell to the receiver in the cell under study, 
and vcc(N) ~ CAf(0,a 2 T) is uncorrected receiver noise. If 
Hi is Rayleigh fading with vec(Hi) ~ £/V(0, £j), tnen 



S = ^P£ 2 P H +cr 2 I. 



(20) 

Note that only the sum covariance matrix Y^iez ^* needs to 
be known when computing the proposed PEACH estimators. 
When (ES is substituted into the PEACH and W-PEACH 
estimator expressions in (0 and (TTOb we get contaminated 
terms of the form RP^PSiP^. These terms are small if 
R and Sj have very different span, or if tr(Si) is weak 
altogether — this is easily observed if P H P is a scaled iden- 
tity matrix. Similar observations were recently made in the 
capacity analysis of 0. A similar criterion was used in [5| to 
obtain a pilot allocation algorithm that tries to minimize the 
impact of pilot contamination. Under certain conditions, the 
subspaces of the useful channel and pilot contamination can 
be made orthogonal by exploiting both received pilot and data 
signals for channel estimation, as proposed in J6). 

C. Numerical Examples 

To evaluate the performance of our proposed estimators, 
we consider a large-scale MIMO system with N T = 100 and 
N t — 10 antennas in addition to the pilot length B — 10. We 
follow the Kronecker model [16] to describe correlation among 
antennas of the desired and disturbance MIMO channels. 
In this model, the covariance matrix of a MIMO channel 
is modeled as R = R t <g> R r where R t £ C NtXNt and 



xv r 



->N r x N r 



are the spatial covariance matrices at the 



transmitter and receiver sides, respectively. Following the same 
modeling, we have Sj = X^ <£> S ri for i £ I. 

To generate covariance matrices, we use the exponential 
model irTTl . Without loss of generality, all the covariance 
matrices will have diagonal elements equal to one which 
results in tr(R) = tr(Sj) = N t N r . We assume that there 
are two dominating interfering cells, i = 1,2, for which the 
covariance matrices are weakened by the factor < /% < 1, 
i.e., ftSj. This factor represents how severe the pilot con- 
tamination part is: /?,; = represents the noise-limited case, 
while /3j = 1 represents the case when the useful channel and 
the zth interfering channel are equally strong. We define the 
normalized pilot signal-to-noise ratio (SNR) as 7 = Sf where 
Vt = -^~t r (PP ) is me average pilot power. 

5 Cell i can use an arbitrary pilot P;, but only pilots with overlapping span 
(i.e., PiP H 7^ 0) will cause interference. Therefore, the case of a common 
reused pilot P; = P is of main interest, while extensions are straightforward. 
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Fig. 2. MSE comparison of different estimators for different polynomial 
orders when j3 £ {0, 0.1}, which represents noise-limited and pilot contam- 
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Fig. 3. MSE comparison of different estimators for different values of 
the SNR 7 when 6 {0, 0.1}, which represents noise-limited and pilot 
contaminated scenarios. 



We use the normalized MSE, defined as 



MSE 

tr(R) ■ 



as perfor- 



mance measure. In all the figures, we compare the performance 
of the proposed estimators with the conventional MMSE and 
MVU estimators. The pilot matrix is P = \/Vt\. 

In Fig. 12 MSE has been plotted as a function of the 
polynomial order L. The noise-limited scenario is given by 
P = 0, while /3 = 0.1 (we assume that j3i — j3 2 = /3) 
represents the scenario where the two interfering cells have 
interfering channels which are 10 dB weaker than the desired 
channels. The SNR is 7 = 5 dB. As can be seen from 
Fig. |2 both PEACH and W-PEACH detract by increasing L 
and outperform MVU for L > 2. Interestingly, W-PEACH 
approaches the MSE-values of the MMSE estimator very 
quickly, while PEACH needs a higher L than W-PEACH to 
get close to the MMSE curves. 

In Fig. [3] we compare different estimators with or without 
additional interference from pilot contamination. We consider 
a fixed L — 10 and vary the SNR 7. As expected, the 
MSE of the MMSE and MVU estimators decay steeply to 
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Fig. 4. Comparison of W-PEACH estimator and Approximate W-PEACH 
estimator in a noise-limited scenario (J3 = 0) for different SNR 7 values. 



zero when the 7 increases in the noise-limited scenario, 
while the MSE saturates to a non-zero error floor under pilot 
contamination. The MSE floor when j3 ^ is due to the 
fact that the normalized signal-to-interference-and-noise ratio 
(SINR) becomes almost constant as 7 increases. To show this, 
we define the normalized SINR as 



7 



l + K/3-y 



(21) 



where K is the number of interferers. As 7 increases, the SINR 
in ( f2Tb approaches -^ > 0, then the MSE values, which 
are different functions of the SINR for different estimators, 
approaches some non-zero limits. 

We observe from Fig. [3] that pilot contamination does 
not make significant impact on the PEACH and W-PEACH 
estimators; in fact, pilot contamination is beneficial in the 
sense that it reduces the gap to the optimal MMSE estimator. 
This remarkable result is explained as follows. For any fixed 
L, PEACH and W-PEACH will converge to a non-zero MSE 
when 7 increases, due to the the bias generated by the 
approximation error. Since this also happens for the MMSE 
and MVU estimators under pilot contamination, the relative 
loss of using the proposed low-complexity estimators will be 
smaller. Consequently, we can reduce L as increases and 
still achieve near-optimal performance. 

Finally, we focus on the low-complexity approach in Al- 
gorithm Q] for finding the weights. Figs. |4] considers a noise- 
limited scenario and a time window of length T — 100. In this 
figure, we observe that the approximate W-PEACH estimator 
which exploits the approximate weights from Algorithm Q] 
gives almost identical performance as W-PEACH with optimal 
weights computed according to Theorem [2] This confirms that 
the W-PEACH estimator is indeed a low-complexity channel 
estimator suitable for large-scale MIMO systems. 

V. Conclusions 

Large-scale MIMO techniques offer a high spatial resolu- 
tion that can drastically increase the spectral and/or energy 
efficiency of wireless systems. Acquiring accurate CSI is 



a key to achieve these potential improvements in practice. 
Since the conventional pilot-based MMSE and MVU channel 
estimators have a computational complexity unsuitable for 
scenarios with large numbers of antennas, we proposed a set 
of low-complexity PEACH estimators. These are based on ap- 
proximating the inversion of covariance matrices in the MMSE 
estimator by an L-order matrix polynomial. The proposed 
estimators converge to the MMSE estimator as L grows large, 
but by deriving the optimal coefficients for the polynomial 
we can obtain near-optimal MSE performance at small values 
on L. In practice, the order L can be selected to balance 
between complexity and MSE performance. Numerical results 
are given for noise-limited scenarios as well as under pilot 
contamination from pilot reuse in adjacent systems. Although 
pilot contamination can create an MSE floor, it is actually 
beneficial to the proposed estimators in the sense that smaller 
L can be used to achieve good performance. 
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